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Abstract— When high-speed media orchannels are 
used, high-speed data compression is desired. Software 
implementations are often not fast enough. In this paper, 
we present the very high speed hardware description 
language (VHDL) modeling environment of Lempel-Ziv- 
Welch (LZW) algorithm for binary data compression to 
ease the description, verification, simulation and 
hardware realization. The VHDL model defines a main 
block, which describe the LZW algorithm for binary 
data compression through a behavioral and structural 
description. The LZW algorithm for binary data 
compression comprises of two modules compressor and 
decompressor. The input of compressor is 1-bit bit stream 
ad in according to the clock eycle. The output is an 8-bit 
teger stream fed into the decompressor, which is an 
index that represents the memory location of the bit 
string stored in the dictionary. The output of 
decompressor is 1-bit bit stream, Once detecting the 
particular approaches for input, output, main block and 
different modules, the VHDL descriptions are run 
through a VHDL simulator, followed by the timing 
analysis for the validation, functionality and 
performance of the designated model that supports the 
effectiveness of the model for the application. 


Keywords— Binary Data Compression, LZW, VHDL, 
Simulation 


1. INTRODUCTION 


Despite the fact that computer memory costs have 
decreased dramatically over the past few years, data 
storage still remains, and will probably always remain, 
an important cost factor for many large scale datebage 
applications [1-3]. Compressing data in a database 
system is attractive for two reasons: data storage 
reduction and performance improvement. Storage 
reduction is a direct and obvious benefit, while 
performance improves because smaller amounts of 
physical data need to be moved for any particular 
operation on the database [4-6] One of the most 
important developments in the study of data 
compression is the modern paradigm first presented by 
Rissanen and Langdon [7]. This paradigm divides the 
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process of compression into two separate components: 
modeling and coding. Modeling is the process of 
constructing representation. Coding entails mapping the 
modeler's representation of the source into a 
compressed representation. The process of converting a 
text, such asa file on disk, a string in memory, ora stream 
alled 


of characters, into a compact representation 
encoding or compression. Decoding or decompression 
restores the original text from its compressed 
representation [8-9]. 


Huffman coding is constrained to represent every 
event using an integral number of bits [10-11]. 
Updating a Huffman tree is much more time 
consuming. Like Huffman, Shannon-Fano coding 
requires to recognize the statistical properties of the 


source file ie. the probability of occurrence of a code 


must be found first before the compression start [12- 
13]. But in most cases, it is not possible to determine 
the statistical propertics of the file due to the 
randomness of the occurrence 


Dictionary compression (also referred to as Ziv- 
Lempel compression or textual substitution) removes 
data redundancy by replacing repeated input substrings 


by references (also called indices or pointers) to earlier 
copies of the identical substring [14-17]. A dictionary of 
characters, words or phrases that are expected to occur 
frequently is maintained and a recurring substring is 
encoded by the index of its corresponding dictionary 
entry. Compression is achieved by choosing indices so 
that on average they require less space than the phrase 
they encode. 


LZW compression implements a variation of LZ78 
due to Welch and initializes the dictionary with the input 
character set [18]. LZW compression is a lossless, 
adaptive and dynamic dictionary compression technique 
[19-20]. LZW compress text, executable code, and 
similar data files to about one-half their original size. 
LZW also performs well when presented with extremely 
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redundant data files, such as tabulated numbers. 
computer source code, and acquired signals. 


Compression ratios of 5:1 are common for these cases 


In this paper we present a model of LZW algorithm 
for binary data using VHDL. The goal of this work is to 
ease hardware realization to achieve high-speed data 
compression and to evaluate the feasibility of using 
VHDL for rapid design and prototyping of the same 


The use of VHDL for modeling is especially 
appealing since it provides a formal description of the 
system and allows the use of specific description styles 
to cover the different abstraction levels (architectural, 
register transfer and logic level) employed in the design 
[21-23]. In the computation of method, the problem is 
first divided into small pieces, each can be seen as a 
submodule in VHDL Following the software 
verification of each submodule — the synthesis is then 
activated. It performs the translations of hardware 
description language code into an equivalent netlist of 
digital cells. The synthesis helps integrate the design 
work and provides a higher feasibility to explore a far 
wider range of architectural alternative [24-27]. The 
method provides a systematic approach for hardware 
realization, facilitating the rapid prototyping of the 
energy meter. 


Il, MATERIALS AND METHODS, 


This research is to develop a model of LZW 
algorithm for binary data compression using VHDL. 
The model is capable of compressing |-bit bit stream 
of binary data. 


The application of the model can be used in various 
data compression devices thus saving memory 
space and reduce the transmission time. A structural 
view ofthe model is shown in Fig. 1 


The input of compressor is. a bit streams, 
where binary data is read in one by one according to the 
clock cycle. The input bit stream is compressed, or 
encoded to another form. The output is an index that 
represents the memory location of the bit string stored 
in the dictionary called a codeword. The output of 
compression is an integer stream, which consists ofa 
predefined number of bits. The number of bits used 
in the compression determines the number of memory 
locations allocated in the dictionary. The selection of 
number of bits is important as the memory locations 


are filled up before all the input bits are read if the 
number of memory locations allocated is not enough 
This make the dictionary “congested” and as a result 
the compression cease. The number of bits used is 
directly proportional to the file size to be compressed. 
Unlike design in software compressor, the software is 
able to check the file size and allocate more bits 
for bigger file size, Since the compressor is hardware. 
where the compressor itself unable to check the file 
size, the number of bits used is predefined. In this 


research, 8 bits are used to design an 8-bit compressor 
thus the memory locations allocated is 28=256 
memory locations. 
Bbt 
top ison Outre 
stream 


Compressor 


Fig. 1: Structural view of VHDL model 


Since the decompressor decompress the output of 
compression, the compression output is used as the 


input for decompression. The decompress or adds a 


new string to the dictionary each time it reads a new 
code. It translates each incoming code into a string and 
sends it to the output. The input tothe decompressor is 
an integer stream consists of 8 bits. The output of the 
decompression is a bit streams. The desired output of 
decompression is binary data, which is same as the 


original binary data. 


Three control signals, clock, selector and output 
select control the operation. Clock signal is used to 
control reset, compression, or decompression Table | 
shows the function and tts corresponding control signal. 


TABLE 1 
Selector And Its Corresponding Funetion 
x Y FUNCTION 
0 0 RESET | 
0 1 COMPRESSION | 
| 0 DECOMPRESSION 
1 1 UNUSED 


The last control signal is output select. When 
output select is “1”, the output is enabled and the 
compressor output the current value of the output. 
whether it is in compression or decompression, This 
signal is necessary because when the compressor 
searches the dictionary to find whether the input 
string is in the dictionary, the searched result is either 
found or not found. If the string is not found, the 
compressor input next byte, concatenates it with the 
previous string, and searches the dictionary again. 
Hence, the compressor does not have an output after 
every new cycle, The output is only enabled when a 
string is not found in the dictionary. 


VHDL language is able to design in hierarchies. 
Reusing components, error management and 
verification allows describing complex circuitry 
efficiently. In VHDL model, three packages std_ 
logic_1164, sid_logic arith and std_logic_ unsigned 
are defined from the IEEE library. These packages 
include definition of standard logic data types, 
arithmetic operations involving standard logic data 
types, integer data types, integer to standard logic 
vector conversion and vice versa. The standard logic 
is the type declaration for bit stream and standard 
logic vector is the type declaration for integer stream. 


The VHDL design entity for the LZW algorithm 
for binary data compression is built by constructing a 
number of entities with certain behavior associated 
with them. The modeling styles of entities that are 
being used: structural, dataflow, and behavioral. 
The model is basically consists of 4 main parts, i.e. 
entity declaration, architecture declaration, signal 
declaration, and clock and control setting. 


The entity declaration specifies the name of the 
entity being modeled. All the interface ports, which 
include input and output, are declared in the entity 
declaration, The entity name of the model is Izw. 
There are 7 interface ports that are declared which 
contributes a total of 22 input and output. Bit stream 
is the input of I-bit bit stream for compression. 
Integer stream consists of 8-bit for compression 
output. Integer_stream_de consists of 8-bit for 
decompression input and Bit_stream_de is the output 
of 1-bit bit stream. The following VHDL code is the 
entity Izw declaration 


entity law is 
port ( 
Bit stream: in STD LOGIC: 
Integer_stream: out STD_LOGIC_VECTOR (7downto 0); 
Integer_stream_dec in STD_LOGIC_VECTOR (7downto 0); 
Bit_stream_de: out STD_LOGIC: 
output_sel: out STD_LOGICE: 
clock: in STD_LOGIC: 
Selector: in STD_LOGIC_VECTOR (Idownto 0) 


% 
end lew 


Architecture defines a body for a component entity. 
An architecture body specifies the behavior between 
inputs and outputs. The architecture name is not the same 
as the component name. The architecture name of the 
model is Izw_archi. Architecture Izw_archi is tied to 
entity Izw. 


All the signals are declared and also the type of 
variable is defined. A partial signal declaration VHDL 
code is given below. 


variable clement 
variable counter 
variable ML_max 
variable ele_num 
variable found 


variable ML. 2 array256; 
variable ocode : integer: 
variable node : integers 
variable ML de —: array2S6slv; 


variable str 
variable hitent 
variable endpro 


: std_logic_veetor(7 downto 0); 
: integer: 
sinteger :=0; 


begin 
process(clock) 


begin 


if (clock'event and clock="I')then 


if(selector="00")then--resee case 
elsif (selector="01")then-compression 
elsif (selector +" 10" jthen-decompression 


end if: 
end if; 


The program starts its execution with process 
“clock” high. The control signal “selector” has three 
functions run at any one time. These unctions are reset. 
compression and decompression. The VHDL code 
formation of clock and control setting is given below. 
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Ill. RESULTS AND DISCUSSIONS 


A test bench is written to perform simulation by 
adding the stimulus and defining the clock cycle to 
verify the correctness of the model 


Apart of the VHDL code of stimulus is given below. 
At the initialization of the simulation, the selector is set 
to “00” to reset the operation, When the time run until 
210ns, the selector selects either compression or 
decompression depends on which simulation is desire 


selector ="00" after Ons, "01" 210ns; 
clock<=not(clock) after 100 ns; 


Bit_stream<="T' after 210ns '0' after 410ns, '0' after 610ns, I! after 810ns 


The simulation is performed using the string of 50 
binary bits. The simulation waveform of compression 
block and decompression block is shown in Fig, 2 and 
Fig. 3 respectively and their corresponding variables 
and functions are tabulated in Table 2 and Table 3 
respectively, 


TABLE 2 
Variable and its function in compression 


Name Function 


Bit_steam Input bit stream 
Output integer stream 
Clock signal 

Output enable 
Funetion selector 


Integr_stream 
Clock 

Output sel 
Selector 


TABLE 3 
Variable and its function in decompression 


Name Funetion 


Bit_stream_de 
Integr_stream_de 


Output bit stream 
Input integer stream 


Clock Clock signal 
Output_sel Output enable 
Selector Function selector 


In Fig. 2, the clock cycle is 200 ns per cycle. The 
selector is “00” from Ons to 210ns. Selector “00” is 
the reset case. After 210 ns, the selector is set to “01” 
to switch to compression mode. Hence, the first input 
bit is read at 210 ns, and after every new cycle, the 
next bit is read. The output is enabled when 
output_sel is high, or equal to “1”. For every high 
state of output_sel, the corresponding value of 
Integer_stream is the output of compression. 
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Pig. 2: The Simulation waveform of compression block 


Input: 


1,0,0,1,0,1.1,0,1,1.0,1,1,0,0,0,1,0,0,1,0,0,1,0,1.0,0, 
1,0,1,1,1,0, 


1,0,1,1,1,0,1,1,1,0,1,0,0,1,0,1.1 
Output: 
1,0,0,2,1,5,7,3 


,10,12,7,5,4,6,17,16,13,1 


In Fig. 3, the clock cycle is 200 ns per cycle. 
The selector is “00” from Ons to 110ns. After 110 ns, 
the selector is set to “10” to switch to decompression 
mode. Hence, the first input integer is read at 110 ns 
and after every 8 clock cycles, or 1600 ns, the next 
integer is read The gap between subsequent inputs is 8 
clock cycles due to 8-bit integer. The decompressor 
read one bit in one clock cycle. Like compression, the 
output is enabled when output_sel is high, or equal to 
“I, For every high state of output_sel, the 
corresponding value of Bit_stream_de is the output of 
decompression 
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Fig. 3: Simulation waveform of decompression block. 


Input: 
1,0,0,2,1,5,7,3,4,3,2,10,12,7,5.4.6,17.16,13,1 


Output: 
1,0,0,1,0,1,1,0,1,1,0,1,1,0.0,0,1,0,0.1,0,0,1,0,1.0,0, 
1,0,1,1,1,0, 


1,0,1,1.1.0.1,1,1,0,1,0,0,1,0,1,1 


The simulation results of input and output of each 
block confirms the correctness of the code. The model is 
also compared with a C model of LZW algorithm where 
the same input is being fed and the output is synonymous 
with the result obtained from VHDL model. 


IV. CONCLUSIONS, 


In this paper, we present the very high speed 
hardware description language (VHDL) modeling 
environment of Lempel-Ziv-Welch (LZW} algorithm 
for binary data compression to ease the description, 
verification, simulation and hardware realization, By 
simulating with binary data and comparing with C model 
of LZW algorithm the proposed approach of VHDL 
modeling of LZW algorithm for binary data 
compression is successfully designed, implemented and 
tested. Currently, we are conducting the further research 
that considers further reductions in the hardware 
complexity in terms of synthesis and ease hardware 
realization. 
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